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Capacity Requirement of a Mail Sorting Device: II 1 

A. J. Goldman 

(July 14, 1959) 

The combinatorial analysis of a mathematical model of a sorting device suggested by 
S. Henig is completed. The relevant parameters are r, the number of destinations for the 
mail, and A', the number of letters entering the device during each cycle of operation. The 
capacity of the device, if it is never to jam, should be between rk and rk — (r— 1) inclusive; 
arguments indicating that the latter value is preferable are given. 



1. Introduction 

We deal with a highly idealized mathematical 
model of a mail sorting device suggested by S. Henig 
(NBS Electronic Instrumentation Section). For our 
purposes the operation of the device can be described 2 
as follows. Mail (to any of r destinations) enters 
the system. After k letters have entered, the device 
"asks itself" to which destination it contains the 
most letters. 3 All letters to this predominant destina- 
tion are then dropped out of the device, another k 
letters enter, and the process continues. The device 
is said to jam if, after a dropout, it is still so "full" 
that entrance of the next k letters would cause an 
"overflow." 

Investigation of the most appropriate capacity for 
such a device leads to mathematically interesting 
problems of two general types. If the capacity is 
to be so chosen that jamming never occurs, then the 
problems are essentially combinatorial in nature and 
can be treated mainly by "counting" methods. If, 
however, the object is the more modest one of keep- 
ing the frequency of jamming down to some specified 
"tolerable level," then rather difficult probabilistic 
problems arise ; this is due to the fact 4 that the 
behavior of the device constitutes a Markov chain 
with a great many states, governed by the probabilis- 
tic distribution of mail by destinations. 

The present paper completes an earlier analysis 6 
of the combinatorial questions, which showed that 



(r-l)(k—l)+k=rk-(r—l) 



(1) 



is a possible capacity for the device, and (if the device 
begins operation empty) is in fact the minimum 
capacity. This left open the possibility of gaining 
some advantage by using a greater capacity than 
that given in (1) and beginning operation with the 
device partly or entirely filled. It will be proved 
below that no capacity in excess of rk should be 
employed, and that any advantage arising from a 



i Pari of a project sponsored by the Post Office Department, Office of Research 
and Development. 

2 The physical device can also operate under dropout rules other than the one 
described below. 

A rule for breaking "ties'' between destinations is also required. 

4 Pointed out (unpublished memorandum, July 1956) by J. R. Rosenblatt 
(NBS statistical Engineering Laboratory). 

* B. K. Bender and A. J. Goldman, Capacity requirement of a mail sorting 
device, J. Research XBS 62, 171 (1959) RP2948. 



capacity greater than rk—(r—l) would be only 
transient. 

The author acknowledges several improvements in 
the exposition suggested by J. R. Rosenblatt. Un- 
fortunately a somewhat more formal treatment than 
that of the previous paper seemed required by the 
greater complexity of the analysis below. 

2. Statement of Results 

In order to describe our results, it is convenient to 
define 

x(t) = number of letters in the device just (2) 
before the tth dropout, 

and also to give a precise definition of ' 'capacity/ ' 

A non-negative integer C will be called a capacity 
for the device if, should the device begin operation 
containing no more than (J letters, it can never sub- 
sequently contain more than C letters under the 
dropout rule described above; symbolically, 

x(t )<C implies x(t)<C for all *>* . (3) 

This condition is clearly necessary to insure that the 
device never jams. 

Theorem 1. C is a capacity if and only if 

rk—(r—l)<C. 

A capacity C will be called efficient if there is at 
least one set of initial contents for the device, with 
no more than C letters, such that the device might 
possibly contain C letters again at some later time; 
symbolically, for some set of initial contents, 

x(t Q )<C and x(t) maxvo * s =C for t>t Q . (4) 

If this condition is not satisfied, then the full capac- 
ity C of the device will not actually be needed after 
the start of operation, a wasteful situation. 

Theorem 2. C is an efficient capacity if and only 

¥ 

rk-(r-\)<C<rk. 

A capacity C will be called essentially efficient 6 if 
there is at least one set of initial contents with no 
more than C letters such that the device with non- 
zero probability will contain C letters infinitely 

6 Use of the term "essential" here, and of the subscript "ess" in (5), was sug- 
gested by a roughly similar usage in measure theory. 



79 



often 7 ; symbolically, for some set of initial contents, 

x(fo)<C and x(t) m&xe88 =C for t>t Q . (5) 

This definition is strengthened to that of a uniformly 
essentially efficient capacity by requiring that the 
stated condition hold for all initial contents of no 
more than C letters; i.e., that 



Theorem 3. The only essentially efficient capac- 
ity is rk—(r—l) = C; it is also uniformly essentially 
efficient. 

The proofs of these theorems are given in section 
3. Section 4 contains a very crude probabilistic 
estimate related to theorem 3. 

3. Proofs 

3.0. Preliminaries 

It is helpful to define 



M(t)=m&xxi(t) 

i 



where 



Xi(t)= number of letters to the ith destina- 
tion in the device just before the tth 
dropout. 

Lemma 1. M(t)^>k implies x(t+l)<Cx(t). 
Lemma 2. x(t)^>rk implies x(t+l)<^x(t). 
Lemma 3. M(t)=k implies x(t J rl)=x(t). 
Lemma 4. M(t)>k implies x{t+\)<x(t). 
Lemma 5. x(t)>rk—(r—l) implies x(t+l) <x(t). 
Lemma 6. x(t )<rk—(r—l) implies 

x(t)<rk—(r—l) for all t>t . 
Lemma 7. x(t )>rk—(r—l) implies 

x(f) <x(t ) for all t>t . 

Lemma 1 is proved by observing that if M(f)^>k, 
then more than k letters leave the device in the 
tth dropout, whereas only k letters enter before the 
(£+l)-st dropout. Since there are only r destina- 
tions, x(t)y>rk implies M(t)^>k, and so lemma 2 
follows from lemma 1 . Lemma 3 is an obvious con- 
sequence of the dropout rule, and lemma 4 is obtained 
by combining lemmas 1 and 3. 

To prove lemma 5, note that since there are only 
r destinations, 

x{t) >rk— (r— 1) >r{k— 1) implies M(t) >k—l, 

so that lemma 4 can be applied. Lemma 6 merely 
states the fact 5 that rk—(r—l) is a capacity. In 



proving lemma 7, two possibilities must oe con- 
sidered. If, on the one hand, 

x(t) >rk—(r—l) for all t>t , 

then by lemma 5, x(t)<x(t ) for all t>t . If, on the 
other hand, there is a ti^>t such that 



x(to)<C implies x(t) m&x eBa = C for t>t . (6) but 



x(t)>rk-(r—l) for * <K*i 
xft)<rfc-(r-l), 



then by lemma 5, x(t) <x(t Q ) for t <t<Ct u and by 
lemma 6 

x(t)<rk-(r—l)<x(t ) for all t>t h 
so that again 

x(t)<x(t ) for all t>t . 

3.1. Proof of Theorem 1 

To prove the "only if" statement, suppose 
C<jrk— (r— 1) and x(t )<C. If, on the one hand, 
x(t ) = 0, then (see op. cit. footnote 5, p. 79) there is 
a sequence of possible events leading to a situation in 
which 



x (t)=rk— (r— 1)>C; 



*<*b+(r-l)(fc-l) + 2. 



This violates condition (3) and so C is not a capacity. 
If, on the other hand, x(/ )>0, then a sequence of 
possible events can be constructed leading to a 
situation in which 



sfa)=0; U<U+C. 



(7) 



7 If this condition is not satisfied, then with probability one the device will, 
from some point on, fail to use its full capacity. 



To do this, choose the k letters entering the device 
between the tth and (£-fl)-st dropouts, for 
to<t<^t u to consist entirely of letters to some one 
destination for which £ Z (Y)>0. Then M(t)^>k for 
to<t<Cti, and so by lemma 1 the number of letters in 
the device steadily decreases until condition (7) is 
satisfied. From this point we may argue as in the 
case x(t )=0 (now using t u instead of t ), again ob- 
taining the conclusion that C is not a capacity. 
To prove the "if" statement, suppose 

(l>rk-{r-\) and x(t )<C. 

If x(t ) <rk—(r—l)j then by lemma 6 

x{t)<rk-{r-l)<C for all t>t , 

while if x(to) >rk—(r—l), then by lemma 7 

x(t) <x(t ) <C for all t>t ; 

in either case condition (3) is satisfied, and so C is a 
capacity. 
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3.2. Proof of Theorem 2 

To prove the "if" statement, suppose 

rk-(r— l)<C<rk. 

The initial contents of the device can clearly be 
chosen so that x(t ) = C and M(t )=k. Among the 
destinations with Xi(t ) = k, select the one "preferred" 
by the tie-breaking rule at the t -th dropout; let k 
letters to this destination enter the device between 
the U-th and (£ -f-l)-st dropouts. Then M(t +1) 
= k; among the destinations with x i (t J rl)=k, select 
the one preferred by the tie-breaking rule at the 
(£o+l)-st dropout, and let k letters to this desti- 
nation enter the device between the (£ +l)-st and 
(£ +2)-nd dropouts, etc. This construction results 
in M{t)=k for all t>t , so that, by lemma 3, 

x(t)=x(t ) = C for all t>t . 

Hence C is an efficient capacity. 

The "only if" statement will be proved next. By 
theorem 1, no C<Crk—(r—l) can be a capacity; thus 
no such C can be an efficient capacity. Suppose 
C^>rk and x(t Q ) <C. There are three possibilities to 
be considered. If x(t ) <rk— (r— 1), then by lemma 6 

z(t)<rk—(r-l)<rk<C for all t>t 

so that C is not an efficient capacity. If 

rk-(r-lXx(t )<rk, 

then by lemma 7 

x(t) <x(h) <rk<C for all t>t , 

so that C is not an efficient capacity. Finally, if 
x(t )y>rk, then by lemma 2 there is a ti^>t Q (with 
ti<to+C—rk) such that 



and 



rk<x{t) <x(U) < C for * <K*i 
x(U) <rk; 



the arguments for the first two possibilities can be 
applied to t x to yield 

x(t) <x(t x ) <rk<C for all t>t x 

so that x(t)<CC for all t^>t and again C is not an 
efficient capacity. 

3.3. Proof of Theorem 3 

First, rk—ir— 1) is a uniformly essentially efficient- 
capacity. To see this, note that the construction 
used above to prove the "only if" part of theorem 1 
shows that if 

x(t )<rk-(r-l) 



then with probability one 8 there is a ti^>t for which 

xfe)=rA:-(7— 1); 

by the very same argument, with probability one 
there is a £ 2 >^i for which 

x(t 2 )=rk— (r— 1), 

and so on. In fact, for rk—(r — 1) the phrase "with 
nonzero probability" in the definition of "uniformly 
essentially efficient capacity" could actually have 
been strengthened to "with probability one." 

Second, no C^>rk—(r—l) is an essentially efficient 
capacity. To prove this, observe that by theorem 2 
attention can be confined to the case 

rk-{r-\)<iC<rk. 

Suppose x(t Q ) <C. If 

x(ti)<rk—(r—l) for some U>U, 
then by lemma 6 

x(t)<rk-(r-l)<C for all t>t u 

so that the device cannot contain C letters infinitely 
often. Thus we may assume that 

x(t)>rk-(r-l) for all t>t , (8) 

so that, by lemma 5, 

x(t+l)<x(t) for all t>t . (9) 

Ifx(to)<C, then by (9) 

x(t)<x(to)<C for all t>t , 

so that the device can never contain C letters. 
Therefore assume that x(i) =C. If 

M(U) > k for some U > t , 

then by lemma 1 and (9) 

x(t)<x(t l + l)<x{t l )<x{U) = C for all C^ 1; 

so that the device cannot contain C letters infinitely 
often. Therefore assume 

M(t)<k for all t>t . 

From (8) and the fact that there are only r destina- 
tions it follows that M(t) >k, and so M{t)=k. Thus 
the only possibility for the device to contain C 
letters infinitely often is given by 

z(f) = C, M(t)=k for all £>* ; 

this clearly has probability zero. 

8 The construction shows that if x(to)<rk— (r— 1), then for any block of 
[(r-l)(fc— D+2-j-r/c— (r— D] values of t>h, there is a nonzero probability (inde- 
pendent of the particular block) that x(t)=rk— (r-1) at least once. 
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4. A Probabilistic Estimate 



Since 



Suppose C is an efficient capacity which is not 
essentially efficient; i.e. (according to theorems 2 
and 3), 

rk—(r—l)<C<rk. 

We know that with probability one, there is a 
ti>t such that 

x(t)<C for all t>U. 

It is of interest to estimate how early the first such U 
(the last moment at which the full capacity of the 
device is used) might occur. In this primarily 
combinatorial paper, we will be content with an 
exceedingly crude probabilistic estimate. 
Denote by E the event 

x(fo) = C and M(t )=Jc. 

By the proof of theorem 3, ti=t unless E occurs. 
Therefore the function of t x to be estimated is 

P(f 1 ) = Y T oh{x(t) = for some t>ti\E} } (10) 

where Prob {A \B] denotes the conditional probability 
of A given B. 

Let the relative frequencies (or probabilities) of 
letters to the different destinations be denoted by 



let 
and let 



0</i</ 2 < . . . <fr, 
C=r(k-l)+s + l; 0<s<r, 
F s =f l +f 2 + . . . +/.- 
It will first be shown that, for any t^>t , 
J?rob{x(t) = C, M(t)=k\x(l—l) = C, M(t-l)=k 

Suppose 



x(t—l) = C 9 



<(l-F s ) k (11) 
M(t-l)=k. 



x(t- 1) = C>r(k- 1) +s= (r-s) (k- l)+sk, 

it follows that Xi(t—l) = k for at least s-\-\ destina- 
tions. The letters to some one of these destinations 
leave the device in the (t— l)-th dropout, and 
M(t)y>k if any of the k letters which next enter the 
device go to any of the remaining 5 destinations 
with k letters each. Since the sum of any s fre- 
quencies ji is at least F s , the inequality (11) holds. 
By the proof of theorem 3, for any one £>£p, 
both x(t— 1) = C and M(t—l)=k hold if and only if 

x(t') = C and M(t')=k for t <t'<t. 

Thus the probability on the left side of (11) can be 
rewritten 

Frob{x(t) = C, M(t)=k\x(t') = C, M(t')=k for 

t Q <t'<t}, 

so that the inequality (11) yields 

UFrob{x(t) = C, M(t)=k\x(t') = C, M(t')=kt Q <t<t 1 

for f < t'<t}<(l-F s )Wi-toK 

This last product is equal to 

Fvob{x(t) = C, M(t)=k for t <t<ti\E}, 

which, according to the proof of theorem 3, is an 
upper bound for P(ti) and so we have proved that 



P(ti)<(l-F,)Wr 



(12) 



If for example k = 2 and F 9 = 0.001, then after 500 
dropouts the probability that the device will make 
any further use of its full capacity is less than 0.50. 



Washington, D.C. 



(Paper 63B2-10) 
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